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ABSTRACT 


Since overhead costs constitute a large percentage of 
total cost for aerospace contractors, it is important to be 
able to predict them accurately. The research performed in 
this thesis takes six government aerospace contractors and 
obtains regression models of their overhead costs that can 
be utilized for forecasting purposes. This method is prefer- 
able to some of the more commonly used methods because it 
estimates overhead costs directly, eliminating reliance upon 
predicted overhead rates. After the data were transformed 
to eliminate the effects of autocorrelation, excellent 
Structural results were obtained for five of the six aero- 
Space contractors. A Monte Carlo simulation was performed 
to compare various estimators of the autocorrelation. Two 
of the estimators were found to be superior. These two esti- 
mators are both two-stage estimators that are calculated 
Wed lizing Wallis's test Statistic for fourth-order 
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I. INTRODUCTION 


The purpose of this thesis is to analyze the overhead 
costs of six government aerospace contractors, determine the 
best estimators of autocorrelation found in the residuaves 
and then obtain regression models for the overhead costs. 
The most frequently used method to estimate overhead costs 
iS yakon twice estimated overhead rates. These rates are 
applied to estimated labor hours or costs in several func- 
tional categories. Summing all of these category values 
gives total overhead. This method results in poor 
estimation in cases where the firm's output fluctuates 
Significantly as in the case of government aerospace 
contractors. Consequently, it is unsatisfactory for use with 
these six aerospace contractors. Another approach, the one 
utilized in this thesis, is to estimate overhead costs 
directly, and hence eliminate reliance upon overhead rates. 

For aerospace contractors, overhead cost comprises 30 to 
50 percent of total cost. Therefore, it is important to be 
able to accurately predict these costs. It is also desir- 
able that the predictive model be simple to utilize, without 
Sacrificing predictive power, since personnel of various 
statistical backgrounds may be required to use it (Boger, 
LYE S5 pp. -7 )x 

The work performed in this thesis is a continuation of 
that presented in Boger (1984). The statistical methods and 
procedures used herein to obtain the predictive models have 
already been proven to give good, useful results. There are 
additions in two areas to the data used by Boger. First, 
data were obtained for two additional quarters.) and, 
secondly, data were obtained for one additional contractor. 
The major extension on Boger, and the emphasis of this 


thesis, is in determining the best method to use to estimate 


autocorrelation. The focus of the analysis performed herein 
is to derive a simple and efficient regression model for 


overhead cost. 


IT. DATA SOURCES AND CHARACTERISTICS 


The data were obtained from six government aerospace 
contractors. To maintain confidentiality, all references to 
specific contractors will be with the labels A through F. 
Prior to obtaining the data, a specific format for data 
collection was selected to insure uniformity of data catego- 
ries among the contractors. The overhead cost data were then 
collected on a quarterly basis for the selected categories 
from each of the contractors, starting with the fia 
quarter of 1979 through the fourth quarter of 19384. \)iine 
data for the last two quarters of 1984 were unavailable for 
contractor E. The data for the last two quarters of 1984 
for contractor B were eliminated from further analysis 
because they were clearly outliers. In addition to the cost 
data, data pertaining to production and operating character- 
istics were obtained. 

There are three major categories, two of which are made 
up of several subcategories, which comprise total overhead 
costs. The first major category, labor related-costs, has 
subcategories of indirect salaries and fringe benefits. The 
second major category, facilities costs, includes all 
facilities-related costs. The last major category, the mixed 
costs category, has three subcategories. These three subca- 
tegories are Independent Research and Development and Bid 
and Proposal (IR&D/B&P) costs, Electronic Data Processing 
(EDP) costs, and a subcategory that contains all other costs 
related to overhead. 

The cost data were then converted from current dollars 
to constant fourth quarter 1984 dollars. This conversion was 
accomplished using Bureau of Labor Statistics (BLS) and 
Gross National Product Deflator (GNPD) indices for the 


appropriate categories. The labor-related costs were 
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Somverted Usime blo sol 3/2, the price index for production- 
worker average hourly wages for the aircraft and parts 
industry. In this case the only indices available were 
monthly indices. The monthly indices were then averaged to 
obtain quarterly values. The facilities costs were adjusted 
uSing the GNPD gross private domestic fixed nonresidential 
investment index, published by the Bureau of Economic 
Analysis. The mixed costs were adjusted using the GNPD 
personal consumption services expenditure index. As with 
all indices, those used here are imperfect. They were 
selected because they should provide the best adjustments 
for inflation among all readily available and relevant 
indices. 

As previously mentioned, data in different production 
and operational categories were also obtained. The only one 
used in this analysis was the direct labor personnel 
category. Due to the nature of this data, no adjustment was 


necessary. 


ll 


III. MODELING QUARTERLY OVERBEAPS Coss 


A. PRESENCE OF AUTOCORREDATION 

Whenever a statistical model is based upon time series 
data, as is the case herein, the residuals can be expected 
to exhibit some form of autocorrelation. Further, the 
shorter the periods of individual observations, the greater 
the likelihood of encountering autoregressive disturbances" 
(Kmenta, 1971, p.270). Thus the presence of autocorrelation 
is more likely with quarterly observations than when the 
data are smoothed by reporting annual values. The most 
common assumption about the form of the autocorrelation of 
the errors terms is that the errors are first-order autore- 
gressive (Judge et. al., 1985, p.275). This model is called 


an AR(1) process and possesses the form 


Et = Ple-1 * Mr, (3.1) 


where ¢, is the error term from a regression model corre- 
sponding to the observation at time t. The 9, term 2sS)3ene 
coefficient of correlation between the related error terms, 
Et, and ¢, of lag 1. The vy, are normally and independently 
distributed random variables with mean Q and constant 


variance o 2 


Dv 
With autocorrelation the assumption that the error 
terms, ¢€ are independent identically distributed normal 
b 


2 is not true. 


random variables with mean O and variance go 
They are related to previous error terms and depend upon the 
form of autocorrelation present. Secondly, when the auto- 
correlation is AR(1l), the variance of the error terms is 
ae - p17) (Kmenta, 1971, p.271). 

When the error terms are autocorrelated the least 


Squares estimators of the regression coefficients are still 
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unbiased and consistent, but they are no longer efficient or 
asymptotically efficient. The standard estimates of the 
variances of the coefficients are also biased. When the 
error terms are positively autocorrelated, as iS most common 
for economic time series data, this variance will be biased 
downward (Kmenta, 1971, pp.278-283). This will cause the 
coefficient of determination, R-squared, and the t and 
F-statistics to be exaggerated (Maddala, 1977, p.283). The 
upward biaS in each of these statistics leads to an unwar- 
ranted confidence in the regression model. 

The adverse effect of an overestimated value of 
R-squared 1s obvious since the higher the R-squared value, 
the better fit the model is assumed to give. An upwardly 
biased t-statistic makes it easier to reject, for any given 
level of significance, the null hypothesis that the regres- 
Sion coefficient equals zero, thus, again giving the false 
impression of a more significant regression model. The 
effect of an exaggerated F-statistic is similar to that of 
the t-statistic. It 1S eaSier to reject the standard 
compound null hypothesis that all regression coefficients 
equal zero for the F-test. 

So, in addition to obtaining unbiased estimates of the 
regression coefficients, it is equally important to obtain 
unbiased estimates of their standard errors. Only then can 
reliable statistics be obtained that can accurately assess 
the quality of the regression model. 

As the AR(1) model indicates, the error terms in one 
period are related to those occurring one period prior. The 
AR(1) process is common in yearly economic data. When the 
data observations are quarterly, a Special form of the 
fourth-order autoregressive, SAR(4), process will be present 
instead of the standard AR(4) process (Wallis, 1972, p.618). 
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This special SAR(4) process has the form 


Er ~ P4&t-4 * Ur: (3.2) 


With this form of autocorrelation the error terms (ae 
related to those occurring in the corresponding quarters of 
Successive years. Using the notation of Box and Jenkins 
this is a seasonal model of order (1,0,0)x(0,0,0)7 ie 
period equals four since the data is quarterly and can be 
expected to show seasonal effects within years (Box and 
Jenkins, 1970. pp 230t.505)- When this SAR(4) process is 
present the variance of the error terms is onan! - p47) 
(Judge et. al., 1985, p.298). This special SAR(4) process 
should be distinguished from the general AR(4) process which 


possesses the form 


Er = PlEt-1 * P2er-2 * P3ZEt-3 * P4Er-4 * V~- (3-3) 


The SAR(4) process used in this analysis considers’ the 
effect of the three previous quarters to be negligible 
compared to that of the same quarter of the previous year 
(Bogeer. 1963- sp. 16). 

Time series plots of the dependent variable (Figures 
3.la and 3.1b) total overhead costs, suggest that the Wallis 
SAR(4) model is appropriate for this problem. As can be 
Seen, the data show a seasonal effect within years. The 
data appear to follow a yearly cycle in which the quarterly 
values of successive years fall in the same relative posi- 
tion with respect to the remaining three quarters of their 
respective years. 

The time series plots of the independent variable 
(Figures 3.2a and 3.2b), direct labor personnel, show that 
they all appear to follow some type of cycle or general 
trend. Contractors C, E, and F all appear to go througheea 
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Time Series Plots of the Dependent Variable. 
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Figure 3.2a Time Series Plots of the Independent Variable. 
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Figure 3.2b Time Series Plots of the Independent Variable. 
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single cycle. They start with an upward trend for approxi- 
mately eight quarters, followed by a gradual decline for 
about twelve quarters, and end with the start of another 
upward swing. Contractor A appears to run through’ two 
shorter cycles. Contractors B and D both appear to follow a 


gradual, steady upward trend. 


Pee LHEORETICAL MODEL UTILIZED 
The general model utilized for overhead costs in this 


analysis is of the form 


Ec = Pi€t-i * Vt, (3.5) 
Miere X- 1s a €x2 matrix and § is a 2xl vector. Y, is the 
dependent variable, total overhead costs. The columns of X, 
are the independent variable, direct labor personnel, 
preceded by a column of 1's for the constant term. Only one 
independent variable is utilized because it satisfactorily 
explains the dependent variable and minimizes the complexity 
of the model. The error term has the structure shown in 
equation (3.5) where i is 1 for an AR(1) process, or 4 for 
Wallis's special SAR(4) process. The yp, are normally and 
independently distributed random variables with mean O and 
constant variance. 

As previously mentioned, when autocorrelation is present 
the estimators of the regression coefficients are not 
Seemerent and their variances are biased. If p is a known 
quantity, then the X and Y variables can be transformed to 
eliminate the effect of the autocorrelation. A regression of 
these new transformed variables, the Generalized Least 
Squares (GLS) solution, yields results that correct these 
deficiencies. However p is seldom, if ever, known. This 


difficulty can be overcome by estimating o from Ene sdatia:. 
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This yields the Estimated GLS (EGLS) solution Ehat §aem 
possesses the desired properties (Kmenta, 1971, p.284). 


C. ESTIMATORS OF FOURTH-ORDER AUTOCGCORR ERATION 

One of the major thrusts of this analysis is to arrive 
at the most efficient estimate of p,. All of the estimatams 
considered fall into one of two categories, iterative or 
maximum likelihood procedures. All but one of the iterative 
estimators can be more specifically classified as two-stage 
estimators. The theory behind maximum likelihood estimation 
can be found in most intermediate level probability and 
SEQEDSENEGS. VEeCXES: The basic approach to performing the 
iterative, including the more specific two-stage, procedure 
is presented in Kmenta (1971, p.288). The first seven esti- 
mators are two-stage estimators, while the eighth estimator 
utilizes the full iterative procedure. The ninth estimator 
is the maximum likelihood estimator. 

Since the AR(1l) process is the most frequently encoun- 
tered form of autocorrelation, most of the estimators of P4 
used herein are simply adaptations of their corresponding pj 
estimator. Judge et. al. (1985) derives six estimators for 
the AR(1) process case. Adaptations of the estimators of 
Judge are given below. 

The first estimator of p, is the standard Prais-Winseem 


estimator 


4 2 ——— (3an 


where e,=Y,_-X,8, the residual from the OLS regression in 
equation (3.4). This estimator is simply the sample 
correlation coefficient when the residuals possess the auto- 


correlation process shown in equation (3.5) for i=4. The 
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AR(1) process version of this estimator is known to give a 
downwardly biased estimate of p, (Park and Mitchell, 1980, 
melo). So we could expect this same result when using 
eqdatiom (3.6) to estimate pz. 


The second estimator of ORES 


where T is the number of observations and K the number of 
parameters that must be estimated, in this case 2. This is 
Simply a modification to equation (3.6) derived by Theil to 
incorporate a degrees-of-freedom correction. AS can be seen 
it will further increase the downward bias of equation 
.6). 

lie sthard estimator of op, is 


P4 = 1 - .5dq, (3 8p) 
where dy, is the Wallis test statistic for the special SAR(4) 


process. As presented in Wallis (1972) the equation for dy 


is 
dy, Se (52 9)) 


So equation (3.8) is easily computed once the test 


aieacistic, dy, has been calculated. 
3 


Hage 


The fourth estimator of D4 is 


n T2(1 - .5d,) + K2 
p4 = —— (3.10) 


This estimator is a modification to equation (3.8) déerivee 
by Theil and Nagar. It is an improvement over equation (3.8) 
if the explanatory variables are smooth, "in particular eee 
the first and second differences of each explanatory vari- 
able are small in absolute value compared to the range of 
the variable itself (intriligator, 19/75 3pelcd 

The fifth estimator of pg, is the Durbin estimator~ Tage 
the coefficient of Y;-4 obtained in a regression of  Yoiiem 


Veo Paree4)? By ple bot apple a eee (3.11) 
t = 505 eae 


The sixth estimator of P4 is obtained from an "Os 


regression of Gey Ome Cea 7 
Cr = Ppaer-4 * Ve> t “= -3.4, 0-.3-ceee le (37a 


The seventh estimator of p, 1s an adaptation Ongtme Park 
and Mitchell (1980) estimator: 


T 
peer et-4 
(3 ep 


o> 
S 


jee 

2 
ye 
t=5 ¢ 


As can be seen this is a modification of equation (3769 
where the summation in the denominator excludes the first 
four and the last observation. This will reduce the downward 


bias associated with equation (3.6). 


Ze 


The eighth |. estimator Ga P4 is the iterative 
Prais-Winsten estimator utilizing equation 3.13. This is the 
Same as eStimator seven except that the iterative procedure 
is carried out until convergence 1S achieved. How thas 
estimator, as well as the maximum likelihood estimator that 
follows, convergence was defined to be consecutive estimates 
within |.00001| of each other. Each procedure was defined to 
be nonconvergent if convergence had not occured within 
fifty-two iterations (Park and Mitchell, 1979, pp.2-6). 

The maximum likelihood estimator, the ninth estimator 
= p,, was obtained using the iterative algorithm derived by 
Beach and MacKinnon (1978). The specific procedure they 
derived was for the AR(1) process, so again minor adjust- 
ments had to be made to tailor it to the SAR(4) process. The 
only alterations necessary were in the calculation of the 


coefficients of the polynomial 
ee: 2 = 
f(p) = p> + Ap + Bp + C = O. Cees) 


The coefficients are now computed as follows: 
T 
A = eel eeerer 4 / DENOM (3:15) 
B = [(T-1l : 4 TF - © 21 / DENOM (3.16) 
= Se - é - © 
lle FT aa a 
c it / DENOM (3175 
= e,e : : 
where the common denominator is 


DENOM = (T 1)(¥ 2 2) (saa 
pes t-4 "peo * = 


74.8) 


The remainder of the algorithm is the same as that presented 
in Beach and MacKinnon (1978). Beach and MacKinnon adver- 
tise that this algorithm should converge in four to seven 


1terations for five drerr accuracy. 


D. TRANSFORMATION FOR AUTOCORRELATION 

If first or fourth-order autocorrelation is shown 
exist in the residuals, then the data must be transformed to 
eliminate its presence. The transformation used for an 
AR(1) process is discussed in Judge et.al. (1985, p.2eanm 
It should be noted here that only one estimator for p, was 


used, the two stage Prais-Winsten estimator derived in Park 
and Mitchell (1980) 


T 
peo Pt et-1 
eae (3.19) 


TU> 
1 


T-1L 

2 
> e 
t=2 © 


This is the AR(1) process version of equation (3.13). Dinu 
particular estimator was selected because it was found to 
perform better than any other commonly used alternative, 
including the more standard Prais-Winsten estimator (see 
Judge et. al., 1985, p.286), in Park and Mitchell s -ige@ 
analysis: 


For the SAR(4) process the transformation is 


Ze (1-927) 7! os, boas cere (3. 209 


N 
ct 
VN 


Ly = Ze oe PAGE oue t= SiGe... ne (3.2 


where py, must be estimated using any of its known estima- 
tors. Some estimators, of course, are more efficient than 
others. This is one item to be resolved for the specific 


models used herein. 
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loEcetenauepotlwort these tEranstormations utilize all T 
observations. An alternative approach is to omit a number 
of the initial observations in the transformation (see 
Meechrane and Orcutt, 1949, p.35). The number of omissions is 
dependent upon the type of autocorrelation found present in 
the data. The first observation is omitted when the data are 
being transformed to eliminate the presence of an AR(1) 
process. The transformation for an SAR(4) process would 
result in the omission of the first four observations. 

It has been shown that the use of all T observations 
generally results in much more efficient results (Spitzer, 
1979, and Park and Mitchell, 1980). It should be noted that 
these results were arrived at by studying the AR(1) process 
transformation where only the first obServation is involved. 
The effects should be even more dramatic for the SAR(4) 
process transformation, since the first four observations 
are involved. It is also worth noting that the relative 
efficiency of these two alternative transformations is 
related to the specification of the independent variable 
@iaeshiro, 1979 and Taylor, 1981). Maeshiro found that in 
Bee case where the independent variable is trended and p=0 
(as is most commonly the case with economic data) the reten- 
tion of all T observations greatly increased the efficiency 
of the estimator. He also found that retention of the 
initial observations was not as critical for the case of an 


untrended independent variable. 


fee PROCEDURE 

The general procedure followed herein was to first 
perform an Ordinary Least Squares (OLS) regression with 
direct labor personnel as the independent variable and total 
overhead cost as the dependent variable. Direct labor 
personnel was selected as the independent variable because 
1t was shown to perform the best among numerous explanatory 


variables in a single variable regression with total 
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overhead cost in Boger's 1983 analysis. The residuals were 
then analyzed and tested for the presence of Wallis's 
special SAR(4) process, the AR(1) process, or a combination 
of both. To do this, the autocorrelation function ote 
residuals was first looked at to get an overall picture of 
the type of autocorrelation present. More formal testing 
was then performed. 

The Durbin-Watson test was used to check for the pres- 
ence of the AR(1l) process (see Kmenta, 1971, pp.295-296). 
The Wallis test, a generalization of the Durbin-Watson test, 
was utilized to check for the presence of the SAR(4) process 
(see Wallis, 1972, pp.624-625). In both cases a two-sided 
test was performed using the null hypothesis Hp: p=0, versus 
the alternative H,: p#0. A significance level of size =qSmame 
was used to define the critical region. 

One problem with both of these tests is the inconclusive 
region between the upper and lower significance points that 
determines the critical region. The size of this inconclu- 
Slve region increases as the sample size decreases or as the 
number of regressors increases (Wallis, 1972, p.625). So in 
this analysis we are handicapped by the small sample sizZe, 
but it is to our advantage here in keeping the number of 
regressors to a minimum. Maddala (1977, pp.285-286) presents 
numerous suggestions, derived by others, in dealing with 
this inconclusive region for the Durbin-Watson test. In this 
analysis we chose the statistically conservative approach of 
ignoring the lower significance point and using only the 
upper point to define the critical region. This rule was 
followed for both the Durbin-Watson and Wallis tests. This 
method is said to perform well in many Situations for the 
Durbin-Watson test (Draper and Smith, 1981, p.16/7). As 
presented in Draper and Smith (1981) the rejection criteria 
for the two sided test for this rule are as follows; 11 d-ae 


Or aeetoaslob. reject Hg at level 2q. Any point that would have 
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fallen in the inconclusive region before, would now fall in 
mine, Critical region and Vead to the rejection of our null 
hypothesis. This treatment of the inconcluSive region is 
also recommended by Hannan and Terrel (1968). They feel that 
the upper Significance point is a good approximation for the 
bound on the critical region when the regressors are slowly 
changing. They further state that economic time series, as 
is the case here, are slowly changing so the upper signifi- 
cance point can be used as the lone significance point for 
the Durbin-Watson test (Maddala, 1977, p.285). In another 
study, Theil and Nagar (1961) computed significance points 
for the Von Neumann ratio of least-Squares estimated regres- 
Sion disturbances, which is equivalent to the Durbin-Watson 
test statistic. Their calculated Significance points were 
very close to the upper significance points for _ the 
Durbin-Watson test. So this also gives added credence to 
uSing the upper point as the sole significance point in 
performing these tests. Though all of the referenced 
results apply to the Durbin-Watson test, this rule was used 
on the Wallis test also because, as previously mentioned, 
the Wallis is a slight modification of the Durbin-Watson 
eis © . 

Next, depending upon the form of autocorrelation found 
present, the data were transformed uSing the appropriate 
transformation. The EGLS solution was then obtained by 
reestimating the model using the transformed dependent and 
independent variables. Again the residuals of this regres- 
Sion were tested for the presence of autocorrelation. This 
cycle of reestimating the model and testing for autocorrela- 
tion was performed until a model was obtained where the 
residuals were free from any autoregresSive process. Once 
this final model was obtained, the residuals were checked to 
insure that they were independent, identically distributed, 


Normal random variables with zero mean _ and constant 


2/7 


variance. In all cases this requirement was met. So the 
final models met all of the necessary assumptions required 


of a correct, reliable regression model. 
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IV. SMALL-SAMPLE PROPERTIES OF SEVERAL ESTIMATORS OF SAR(4) 


A. GENERAL 

As previously mentioned, one of the purposes of this 
thesis is to determine the best estimators of fourth-order 
autocorrelation. This is important because the performance 
of the EGLS regression is dependent upon the quality of the 
Sieimator (Rao and Griliches, 1969, p.258). im, orders £o 
evaluate the nine estimators presented in Chapter 3, a Monte 
Carlo simulation was carried out to determine their relative 
performances. This chapter explains the simulation and 
presents the results. The three estimators that performed 
the best in this simulation were then used to obtain struc- 
tural models for each contractor. These three models then 
provided another basis of comparison for the three final 
estimators. The computer programs utilized in the simula- 


tion are contained in the appendix. 


B. RESULTS FROM PREVIOUS MONTE CARLO ANALYSIS 

No previous simulations comparing estimators of fourth- 
order autocorrelation could be found. Therefore the results 
of this simulation could only be compared with results 
obtained from previous simulations that evaluated various 
estimators of first-order autocorrelation. Even though 
these past Monte Carlo simulations dealt with the AR(1l), 
instead of SAR(4), process estimators, their results should 
still be useful in predicting the relative performance of 
Pieevearious estimators of p,. Of the nine estimators evalu- 
ated in this thesis, results comparing the AR(1l) process 
versions of only estimators one, five, seven, eight and nine 
could be found. As will be shown later in this chapter, 
estimators three and four proved to be the best of the nine 


estimators tested herein. It would have been interesting to 
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see how their first-order autocorrelation versions would 
have compared in these previous studies. 

The Spitzer and Rao and Griliches studies compared the 
Durbin and standard Prais-Winsten estimators of first-order 
autocorrelation for Mean Squared Error (MSE) of B5. In both 
analyses the Prais-Winsten estimator performed better for 
lower poSitive values Gans 522 while the Durbin estimator 
dominated for the higher values. The Spitzer study “aie 
included the maximum likelihood estimator and showed that it 
was better than both the Durbin and Prais-Winsten estimators 
for p ,>-6 for MSE of Bo, and better than the Durbin esti- 
mator for p,;>.3 for MSE of p, (it wasn’t compared tommege 
Prais-Winsten estimator for MSE of p,). Park and Mitchel 
compared four estimators in their 1980 analysis for RMSE of 
B; and B5. The estimators they analyzed were the iteratage 
Prais-Winsten, the maximum likelihood, their version of the 
Prais-Winsten (Equation Cao) a and the standard 
Prais-Winsten estimators. Their maximum likelihood estimator 
was computed utilizing Beach and MacKinnon's algorithm. The 
iterative Prais-Winsten estimator was the best of the four 
with a slight edge over the maximum likelihood estimator. 
Since the iterative Prais-Winsten estimator outperformed its 
two-stage counterparts it was shown that iteration leads to 
a more efficient estimator. Of the two stage estimators, 
their version of the Prais-Winsten estimator was better than 
the standard version. Park and Mitchell's 1979 study showed 
that the iterative Prais-Winsten was also better than the 
maximum likelihood estimator for MSE of p,. All of these 
Studies showed that all of the estimators outperformed the 
OLS solution when a Significant amount of autocorrelation 


was present in the residuals (p;>.2). 
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Gee DHE MODEL 


The model utilized in the Simulation was 


Y, = By + BoX, * et (4.1) 

Er = P4tr-4 * Vt» (4.2) 

where ¢,~N(0,0,7/(1-p,7)) ~N(0,0.7), t = 1,2 T, and 
t 01) P4 3 Ur »04) ’ ss eee 9439 

lp4|<1.0. Three different independent variables were 

utilized in the simulation. They were the direct labor 


personnel for contractors A, B, and E. These were selected 
because they were three of the models where the SAR(4) 
process was found to be the most significant form of auto- 
correlation present in the residuals. A separate simulation 
run was performed for each of these so that the relative 
performance of the estimators could be compared for indepen- 
dent variables with different structures. The value of Sy 
was particular to the contractor for which the simulation 
run was performed. 

It was desired that the generated dependent variable, 
¥,, be comparable in value and structure to the EGtal: 
overhead cost for the respective contractor (the dependent 
maple, Y_, im equation (3.4%)). Therefore the py, terms of 
@quation (4.2) had to be proportional to the yp, terms of 
equation (3.5). To accomplish this, the value of oy was the 
variance of the residuals, v_» obtained from the OLS regres- 


Sion of One ere 7: 


fr = P4&r-4 * Ve- (4.3) 


Mile variables e, and e,_,4, were the (unlagged and lagged) 
residuals obtained from the OLS regression in equation 
(3.4). The sample size, T, was simply the number of data 
observations for each contractor, twenty-four for contractor 
A and twenty-two for contractors B and E. Each Simulation 


was replicated one hundred times. 
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D. DATA GENERATION 

For each simulation the independent variable, Xe s the 
regression coefficients, §},; and §5, and p, were )predecom. 
mined, fixed values. AS previously mentioned the three 
different independent variables were the direct labor 
personnel for contractors A, B, and E. Each simulation was 
ELUM fOr Values sor OVA Se Tacs .3, .4, .5, .6, .7, oe 
and .95. The value of py, was restricted to positive values 
because this is the region most likely to be encountered for 
the SAR(4) process with economic data. As in the simulation 
performed by Rao and Griliches in 1969, the constant term, 
Bj owas (S@G a6 ezero] ethe value of the slope, Bo, was then 
set at a value that generated dependent variables, Yes 
proportional in size to the respective contractor's total 
overhead cost. The value of Bo for each simulation, is 


contained in Table l. 


TABLE 1 
SIMULATION PARAMETERS 


Simulation run 


Contractor 
COnEractor 
Contractor 





Aw 
With the ultimate goal of determining Y,, the data were 


generated using the following algorithm: 


(1) Thirty-two Db; values were randomly generated )tronmaa 


normal distribution with mean O and variance Gia The value 


of the standard deviation, o for each simulation, is shown 


Dc 
in Table 1. New Ut values were drawn for each of the L100 


replications. 


a2 


(2) The first four values of ¢, were computed to be 
E+ = Ut i lie) alee Cm 152,25 4a (4 -4) 


This generated error terms from the normal distribution with 
; Z 2 

mean O and variance Ty / (1-pg*)- 

(3) The next twenty-eight (twenty-four for contractors B and 


E) values of ¢, were generated from 
Ep = P4ee-G + Vis t = tO. s: ace: + (4335)) 


A total of T + 8 values of ¢, were generated. The First 
eight of these values were dropped so that the first four 
values, generated by step (2), didn’t excessively influence 
the sample (Spitzer, 1979, p.46). This left us with T 
values of ¢, that possess the SAR(4) process shown in 
equation (4.2). 


(4) Using B,=0, Pew respecuiIvel iio, X-, andweche ¢. generated 


above, the dependent variable Y,, was generated as follows 


Fas 
Ye = By + BoXpe * E- (4.6) 


An OLS regression was then performed with direct labor 
personnel as the independent variable and Y, the dependent 
variable. The residuals from this regression were then used 
to compute the nine eStimators of D4 - As in Park and 
Mitchell's 1979 analysis any eStimator that equaled or 
exceeded 1.0 was reset to .99999. For the two iterative 
algorithms, the iterative Prais-Winsten and the maximum 
likelihood, if two consecutives estimates exceeded 1.0 they 
were both reset to .99999 and convergence was declared. The 
results for these two estimators are only for the cases when 


convergence was attained. 


a5 


E. MEASURES OF hEZerlyENEess 

Three MOES were used to determine the relative perform- 
ances of the estimators, the RMSE of p4, the RMSE of the 
slope coefflicient. Bos and the adjusted R-Squared value. 
Each MOE was computed and averaged over the one hundred 
replications for each value of 7578 

To evaluate how accurately each estimator predicted D4 
and its variance the RMSE of p, was computed (Rao and 


Griliches, 1969). The equation is 


100 
RMSE (py) = LE, (By-py*/100]7/%. (4.7) 


The «RMSE 0 & Bo evaluated the estimators in terms Of 
their performance for the slope coefficient, Bo To make 
comparisons easier, the performance of each estimator (EGLS 
solution) relative to that of the OLS solution was computed. 
As presented in Park and Mitchell (1980) the relative effi- 


ciency is 


RMSE (85, OLS) 


|<) ee Op a aa , eStimator.) = : 
‘Bz i) = RMSE (Bo, EGLS, ) 


(4.8) 
where 
00, 
RMSE (B>) = [E(Bo-Bp)? ( T00iee ae (4.9) 


The last MOE utilized was the adjusted R-squared value 
from each EGLS-~ regression. Though three MOE's’ were 
utilized, the RMSE of op, was considered the most important. 
The other two were considered only if the RMSE ot ip; 


sufficiently close for alternative estimators. 
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Eee RESULTS 
1. General 

The results for the three simulations are summarized 
in Tables 2 through 4 and Figures 4.la through 4.3c. The 
three different simulation runs are identified by the 
contractor label. Since each simulation was run with a 
different independent variable, each having its own unique 
structure, the results vary slightly between simulations. 

The two iterative procedures had slight convergence 
problems at low values of py, (py<s-.3). This could possibly 
have occured because at low values of p, the error terms, 
eeeectitil do not have a significant SAR(4) process struc- 
ture. So the first estimate could be a poor one and the 
iterative procedure could proceed in the wrong direction 
(most likely toward negative values of py). The convergence 
peoblem decreased aS py, increased, such that by p,=./ 
convergence occured almost 100 percent of the time. For both 
estimators convergence generally occured in four to six 
iterations. 

Zee BStimation of Rho 

The RMSE of py, was used to determine which estima- 
foes provided the best estimate of p,. As can be seen in 
Table 2 and Figures 4.la through 4.lc, no estimator was 
uniformly the best. Estimators three and four, the two 
estimators that utilized the Wallis test Seatistie. 
appeared to be vastly superior over the entire range of P4- 
They were only outperformed at the extreme low end by 
estimators one and two, the two versions of the sample auto- 
correlation coefficient. Estimator three was the best in the 
mee -2<p,<-5. A crossover occured at p, equals .6 and 
estimator four dominated for the upper range of py. An 
exception to this was for the contractor A simulation where 
estimator nine was the best for By 2) BStimactor nine. ene 


maximum likehood estimator, appeared to be the third most 
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efficient, with the iterative Prais-Winsten estimator a 
close fourth. It was interesting to compare the performance 
of the standard Prais-Winsten estimator, estimator one, with 
that of Park and Mitchell's version, estimator seven. 
Estimator one was better in the lower range (p,<.5) with 
estimator seven better from that point on. Overall though, 
it appeared that estimator seven was the better of the two. 
It could also be observed that all estimators except one and 
two improved as py, increase. Estimators one and_ two 
performed better at the lower Send) Wooo Recall that 
these two estimators are known to be downwardly biased in 
their AR(1l) process forms. 
3. Estimation of Slope Coefficient 

Most noteworthy was the fact that all estimators 
provided more efficient estimation than did the OLS solution 
for p4>.2. As can be seen in Table 3 and Figures “4a 
through 4.2c, none of the estimators dominated over any 
Significant range of pg. The two best performers appeared 
to be estimators eight and nine. These two estimators poss- 
essed a slight edge over estimators three and four. The 
performances of the remaining estimators, except for estima- 
tors one -and two which were clearly inferior, were very 
comparable and no significant distinctions could be drawn. 

4. EGLS Regression Quality 

The adjusted R-squared value was selected as the MOE 
to evaluate the quality of the EGLS regression. Again, as is 
Shown in Table 4, all estimators provided better results 
than did the OLS solution. As can also be seen in Figures 
4.3a through 4.3c estimators three and four were again 
vastly superior. This time estimator four performed the best 
until it was surpassed by estimator three at approximately 
op, equal .8. Estimator nine again appeared to be the third 
best performer. Except for estimators one and two, which 


were again slightly inferior, the remaining estimators were 


40 


TABLE 3 
RELATIVE EFFICIENCIES FOR THE THREE SIMULATION RUNS 
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TABLE 4 
ADJUSTED R-SQUARED FOR THE THREE SIMULATION RUNS 
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all relatively comparable in their overall performance. 
Estimators one and two were the best at the extreme high end 
(o4=-95) in two of the simulations however. In comparing the 
two Prais-Winsten estimators, estimator seven performed 
enter fOr p,<./, with estimator one better for p,>./. 


Again estimator seven appeared to be the better of the two 


estimators. 
. Summary 


All of these estimators provided an improvement over 
the OLS solution when the SAR(4) process was present in a 
euenlticant amount (p,>.2). This mirrors the results of the 
previous Monte Carlo analyses for the AR(1) process cited 
earlier in the chapter. Liismesimulatton indicated that 
Soetmators three and four were the best estimators of p,. 
Different results may be obtained for differently structured 
independent variables. The performances of estimators three 
and four were nearly identical. The selection of one over 
the other may be based upon the smoothness of the explana- 
Boemy Variable or the approximate value or range of p,, if 
known. It was expected beforehand that the maximum likeli- 
hood and iterative Prais-Winsten estimators, the two most 
time costly estimators, would have outperformed all of the 
Memer estimators. They did in fact finish third and fourth, 
with estimator nine proving to be slightly better than esti- 
mator eight. It is worth noting again that none of the 
previous AR(1l) process studies compared the AR(1) process 
versions of estimators three and four. Therefore it was 
believed that, like all of the other two-stage estimators, 
they would have been outperformed by the maximum likelihood 
and iterative Prais-Winsten estimators, as was shown to be 
the case in the previous studies. The result that estimator 
nine was better than estimator eight was the opposite of 
that obtained in the previous AR(1) process studies, but in 


both cases their performances were very close. As in the 


o 
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previous studies it was proven that iteration leads to a 
more efficient estimator, as the iterative Prais-Winsten 
estimator waS again shown to be better than its two-stage 
counterparts. Of the two Prais-Winsten estimators, esti- 
mator seven (Park and Mitchell's version) appeared to be 
better overall. This agreed with Park and Mitchell's find- 
ings in their analysis of first-order autocorrelation estes 
mators. Knowledge of the approximate range of p, beforehand 
would help in selecting the more appropriate of these two 
estimators. In the lower region estimator one was better, 
while estimator seven dominated in the upper region. The 
next chapter presents and compares the final regression 
models obtained for the three estimators, numbers three, 


four, and nine. 
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V. STRUCTURAL ANALYSIS 


A. GENERAL 

In this chapter the procedures outlined in chapter three 
were followed to obtain EGLS regression models for each 
contractor. A separate model was obtained using each of the 
three preferred estimators from the previous chapter, esti- 
mators three, four, and nine. All of the models are 
presented for comparison. Due to the large number of models 
obtained the entire procedure is illustrated in detail for 
only one, contractor A's EGLS model for estimator three. 
Only the final results of the other models are presented. 
In all cases direct labor personnel was utilized as the 
explanatory variable and total overhead costs as_ the 
dependent variable. The computer programs utilized in the 


Structural analysis are contained in the appendix. 


be PROCEDURE 

Table 5 presents the results of these procedures applied 
to the regression of total overhead costs for contractor A 
(TOTOHA) on direct labor personnel for contractor A 
(DIRPERA). The results of this initial regression indicated 
very poor results. The adjusted R-Squared value was very 
low and the F-statistic (not including constant term) was 
very close to its five percent critical value of 4.32 even 
though both were inflated due to the presence of autocorre- 
lation. The low R-Squared value indicated that the 
regression equation explained little beyond the mean of the 
dependent variable (Boger, 1983, p.21). Though biased 
downward, also due to the presence of autocorrelation, the 
standard errors of the regression coefficients were still 


large relative to the magnitude of the coefficients. 
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TABLE 5 


RESULTS FOR CONTRACTOR A 


Model: TOTOHA ="a = bDIRERERS 


Standard Error of the Regression: 
Adjusted R-squared: 
F-statistic (degrees of freedom): 
Estimate of a: 

) Standard. Erion: 
Estimate of b: 

standard Error: oe 
Durbin-=Wwaeson Test Statistic. 
Wallis Test Statistic: 
Estimator Three: 
ESEIMacor sour: 
Estimator Nine: 


Estimator Three 


Standard Error of Regression: 
Adjusted R-squared: 
PoSe at donde. 
ESlimace-on. a: 
Vobanadand (rron: 
Estimate of b: 
Sbandand) Error: 


Estimator Four 


Standard Error of Regression: 
Adjusted R-squared: 
E-SvatlsctiLre, 
Estimate of a: 
pvotandard Error. 
Estimate of b: 
Sy ereh Onsite oglols Deqoglolig’ 3 


Estimator Nine 


Standard Error of Regression: 
Adjusted R-squared: 
Pot dieds basen 
Estimate of a: 
Standard Eire. 
ESLImatevotsb: 
Seandate Eran. 
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Data 


The autocorrelation function of the residuals obtained 
from this regression (Figure 5.1), with its single signifi- 
cant spike at lag four, strongly suggested the presence of 
Wallis's SAR(4) process. Upon formally testing the residuals 
for the presence of this SAR(4) process, the null hypothesis 
that no fourth-order autocorrelation was present was clearly 
rejected. At this point the Durbin-Watson statistic was 


masienitficant. 
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Figure 5.1 Autocorrelation Function of the Residuals 


for Goncractor A. 


As can be observed, the three calculated estimators of 
p4 were relatively close. The data were then transformed and 
the model reestimated. This step was performed three times 
in order to obtain a reestimated model for each of the esti- 
mators of py. The residuals of the new model were then 
analyzed for the presence of autocorrelation. In all three 


cases the Durbin-Watson statistic proved to be significant, 
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indicating the presence of first-order autocorrevar1onee re 
data were again transformed, this time to eliminate the 
AR(1) process (using the calculated estimator of 97), Jame 
the model was reesStimated. Examination of the residuals from 
this model resulted in the finding that both the 
Durbin-Watson and Wallis test statistics were insienificamm 
indicating that neither the AR(1) nor SAR(4) processes were 
present in the residuals. The autocorrelation function of 
these residuals (Figure 5.2) also showed that no autoregres- 


Sive process was present at any level. 
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Figures) 22 Autocorrelation Function of the Residuals 


for Contractor A's model (estin@ecue pre 


The resulting model now had residuals that were free of 
autocorrelation. The residuals were then analyzed to deter- 
mine if all of the assumptions required for the regression 
were satisfied. Note that this does not mean that it has 


been concluded that the assumptions are all necessarily 
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correct. It merely means that "on the basis of the data we 
have seen, we have no reason to say that they are incorrect” 
(Draper and Smith, 1981, p.142). 

Three tests were performed to check the normality of the 
residuals. An empirical cumulative distribution function 
(CDF) was generated to compare the CDF of the residuals with 
that of the appropriate distribution. In this case the 
appropriate distribution was normal with mean Zero and 
Standard deviation 9099. A probability (Q-Q) plot of the 
residuals was also generated which plotted the quantiles of 
the residuals against the corresponding quantiles of the 
appropriate distribution. Each of these plots (Figure 5.3) 
were bounded by the ninety-five percent Kolmogorov-Smirnov 
(K-S) confidence boundaries. Both of these plots lie 
completely within the K-S boundaries, supporting the assump- 
tion that the residuals were distributed normally with mean 
zero and standard deviation 9099. To more formally test this 
hypothesis a Kolmogorov-Smirnov goodness-of-fit test was 
Beecormed with null hypothesis Hp: F(x)=F*(x), versus Hy): 
F(x)#F*(x), where F*(x) is the normal distribution with zero 
mean and standard deviation 9099 and F(x) is the unknown 
@estribution function of the data. As presented in Conover 
(1980, p.347) the test statistic for the K-S test is simply 
the greatest distance between the cdf, F*(x), and empirical 
G@ameeof the data. It is possible to obtain this test 
Statistic directly from the normal cdf plot in Figure 5.3. 
The test was performed uSing a significance level of size 
a=.05. The K-S test statistic was significant at a level of 
.9693. Therefore, the null hypothesis could not be rejected. 

Two tests were performed to test the constant variance 
(homogeneity of variance) assumption. The residuals were 
plotted against the predicted dependent variable (Figure 
5.4) to see if any obvious abnormalities could be observed. 


The dispersion of the residuals appears to be fairly random 
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Tests for Normality of Residuals 


Figure 5.3 


for Contractor A's model (estimator 3). 
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Figure 5.4 Tests for Homogeneity of Variance of the Residuals 


for Contractor A's model (estimator 3). 


and the variance relatively constant throughout. An F test 


was then performed. The residuals were divided into two sets 


and the null hypothesis Hp: 91 2=09° 


2. Zz 


was tested against Hy: 


oy ta9°, where O7 and O9° were the variances of the two 
sets of residuals. As presented in Mood, et.al.(1974, 


p.438) the test statistic for this F-test is 


Bon DCs - x)- am 

(ny - L)E(Xpy - Ky)?’ } 
where ‘Xy and Xy are the population expected values. This 
test statistic has the F distribution with degrees of 
freedom ny and no-l when o17=097. This test was also 
performed using a significance level of size q = .05. The 
test statistic was significant at a level of .5154. 
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Therefore the constant variance assumption could not be 
rejected. 

These tests could not reject the assumption that the 
residuals were in fact normally distributed random variables 
with zero mean and constant variance. 

' The results for this regression indicated thatwueeme 
model contained a great deal of information on overhead 
costs. The R-squared value was significantly high indicating 
that the model contained much more information than just the 
mean of the dependent variable. The standard errors of the 
regression coefficients, especially that of the slope term, 
were now relatively small in comparison to the coefficients. 
In summary, after transforming the data numerous times to 
eliminate all autocorrelation from the residuals, an EGLS 
regression model was obtained that yielded excellent, reli- 
able results. This model for the total overhead costs for 


contractor A was 
TOTOHA = 69090 = 12.18 DIRPERAS 


Since all costs were measured in thousands of dollars, the 
model may be interpreted as indicating that there is a fixed 
cost component of approximately $69,090,000 to overhead 
costs (when a function of direct labor personnel) with an 
additional $12,180 per direct labor personnel to total over- 
head costs (Boger, 1983, pp.24-25). 

The results of the final EGLS models for contractor A 
uSing estimators four and nine are also contained in Table 
5. All required assumptions about the residuals in these 
models also appeared to be valid. All of the models yielded 
excellent results that were very comparable. The slope 
terms of all of the models were within eight one-hundredths, 
a spread of approximately three one-hundredths of one stan- 
dard deviation. The model for estimator three was slightly 


Superior however. 
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This same general procedure wasS carried out for all of 
the remaining models, but only their final results are 
reported. The only difference in the procedure for the 
different models was in the order that the various autore- 
gressive processes were removed from the residuals. The 
normal sequence was to remove the AR processes by order of 
Significance. To get an initial overview of the type of 
autocorrelation that was present in the residuals of the 
initial models, the autocorrelation functions of the resi- 
duals (Figure 5.5a and 5.5b) were examined. This, combined 
with the results of the Durbin-Watson and Wallis tests, gave 
the AR process that the data were to be adjusted for first. 
The remaining order was then determined from the results of 
the Durbin-Watson and Wallis tests performed on the 
residuals of the previous model. 

The autocorrelation functions showed that only the resi- 
duals for contractor A clearly appeared to possess’ the 
pattern expected to be exhibited by residuals containing 
Wallis's SAR(4) process. It was also clearly visible that 
the models for contractors C and D did not possess this 
SAR(4) process at a significant level. As with all of the 
contractors they will be examined more thoroughly later in 
this chapter. Despite the uncharacteristic appearance of 
their autocorrelation functions it appeared that Wallis's 
SAR(4) process was also the most significant form of 
autocorrelation present in the initial models for the 
remaining contractors (B, E, and F). 

A number of factors led to this conclusion. First, the 
amount of data was very small relative to the amount 
required to obtain an accurate portrayal of the autoregres- 
Slive process from the autocorrelation function. It should be 
noted that in the cases of contractors A, B, E and F the 
Spike for the lagged four residuals was generally most 


Significant. Secondly, the data were quarterly and as was 
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Figure 5.5a 


Autocorrelation Functions of the Residuals. 
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Figure 5.5b 


Autocorrelation Functions of the Residuals. 
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LAGS 


mentioned in chapter three, the plots of their dependent 
variables did appear to exhibit the seasonal pattern 
decribed by Wallis. Last, as will be shown, the Wallis test 
statistic was Significant for the residuals from each of 


their initial models. 


CC. THE REMAINING MODELS 
I Comtidiceormes 

Table 6 contains the results for contractor B. The 
results of the initial regression of the untransformed data 
were fairly good. All of the statistical results, adjusted 
R-squared, standard error of regression, F-statistic, and 
Standard errors of the regression coefficients, supported 
this conclusion. But these statistical results could have 
been misleading due to the presence of autocorrelation. 
Analysis of the residuals showed the Wallis test statistic 
to be significant, indicating the presence of the SAR(4), 
while the Durbin-Watson test Statistic was shown to be 
insignificant. The data were then transformed and the model 
reestimated for each of the three estimators of p97,. Aga 
all of the estimators were fairly close. These new models 
all provided excellent results. Analysis of the residuals 
showed no presence of any AR process, and none of the neces- 
Sary assumptions pertaining to the residuals were violated 
in any of the models. Again the performance of all three 
models was very comparable, but the one obtained using 
estimator nine was slightly superior. It should be noted 
that all three models have negative intercepts. This implies 
negative fixed costs for those models. Though implausible, 
it is not totally infeasible. The models are being fit to 
data that are far from the Y-axis and, as with all models, 
these are only valid within the relevant range defined by 


the chosen explanatory variable. 
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TABEE 6 


RESULTS FOR CONTRACTOR B 


Model: TOTOHB = 


Standard Error of the Regression: 
Adjusted R-square 
F- SEES (degrees of freedom): 
Estimate of 

eae Eiaaso'r : 
Estimate of b: 

Standard Error: —— 
Durbin-Watson Test Statistic: 
Wallis Test Statistic: 

Estimator Three: 
Estimator Four: 
Estimator Nine: 


Estimator Three 


Standard Error of Regression: 
Adjusted R- Squared: 
F-statistic: 
Estimate of a: 

Standard Doeaglope x 
Estimate of 

Standard Error: 


Estimator Four 


Seandard Error of Regression: 
Adjusted R- Squared: 
F-statistic: 
Estimate of a: 

Standard Error: 
Estimate of b: 

Standard Error: 


Estimator Nine 


Seandara Error of Regression: 
Adjusted R- Squared: 
F-statistic: 
Estimate of a: 

Standard Piewor. 
Estimate of 

Standard Error: 
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The results for contractor C are contained in Table 
7. Very poor results were obtained for the initial regres- 
Sion. From the autocorrelation function of the residuals of 
this model (Figure 5.5a) it appeared that the AR(1) process 
was the most Significant form of autocorrelation present. 
The Durbin-Watson test statistic supported this finding ee 
was Significant, indicating the presence of first-order 
autocorrelation. AS was expected, the Wallis test statistic 
was inSignificant. The data were then transformed _ to 
eliminate the presence of the AR(1l) process, and the model 
reestimated. Examination of the residuals of this model 
Showed no autocorrelation present nor any required assump- 
tions violated. The final model had been obtained without 
requiring a data transformation for the SAR(4) process. The 
results of this model, though inferior to the previous two, 
were fairly good. 
jo ‘Conkractor Dp 
Table 8 presents the results for contractor D. Again 
fairly poor results were obtained for the initial regres- 
Sion. From the autocorrelation function of the residuals of 
this model (Figure 5.5b) it appeared that no AR process was 
present in any Significant amount. ASsS_ expected, the 
Durbin-Watson and Wallis test statistics were both insignif- 
icant. So the final model, though rather poor, had been 
obtained without requiring any data transformations. Further 
analysis of the residuals indicated that again all necessary 
assumptions appeared to hold. The model for contractor D's 
total overhead costs is probably unreliable. 
4. Contractor E 
The results for contractor E are presented in Table 
9. Again very poor results were obtained for the initial 
model. The sequence of steps for contractor E were the same 


as that for contractor A. First, it was necessary )fo7 eee 
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TABLE 7 
RESULTS FOR CONTRACTOR C 


Model: TOTOHC = a + bDIRPERC 
Untransformed Data 


Standard Error of the Regression: LEZ G@x 

Adjusted R-square 

F- 2 Be en Siae (degrees of freedom): 7 

Estimate of a: 158100. 
eaancarel EyEnor : 12440. 

Estimate of b: Ze 
“Stanaana Lrnror: i. 

Durbin-Watson Test Statistic: 

Wallis Test Statistic: 1 

Estimator of pj, “62009 


Transformed Data 


Estimator for P] 


Standard Error of Regression: 
Adjusted R- Squared: 
F-statistic: 
Estimate of a: 

Standard Eero: 
Estimate of 

Standard Error: 





data to be transformed to eliminate the presence of the 
SAR(4) process, and then the model was reestimated. As 
usual, a reestimated model was calculated for each of the 
three estimators of py. Estimators three and four were again 
very close while estimator nine was significantly larger. 
Next it was necessary (in all three cases) for the data to 
be transformed to eliminate the AR(1) process from the resi- 
duals. The EGLS regression of this transformed data proved 
to be the final reestimation required. Analysis of these 
residuals showed no presence of autocorrelation nor viola- 
tion of any required assumptions. All three final models 


provided fairly good results and were again very comparable. 
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TABLE 8 
RESULTS FOR CONTRACTOR DB 


Model: TOTOHD = a + bDIRPERD 


Untransformed Data 


Standard Erron so the Regression: 
Adjusted R- “(degre 


F- eee degrees of freedom): : (1,225 
Estimate of : : 
Standard Etcror: 
Estimate of b: 
Standara Error: 
Durbin-Watson Test Statistic: 
Wallis Test Statistic: 





The model for estimator nine was again slightly better, 
however. 
>. Contractor F 

Table 10 presents the results for contractor F. This 
time the results for the OLS model were fairly good. But 
again, these statistical results could have been misleading 
due to the presence of autocorrelation. AS with contractor E 
the data had to be transformed for the SAR(4) and then the 
AR(1) process, and the model reestimated before a final, 
"uncorrelated" model was obtained. Examination of these 
residuals showed that no type of autocorrelation was present 
and that all required assumptions appeared to hold. The 
results from all three final models were again very 
comparable, and provided very good results. The model for 


estimator three was slightly superior, however. 


D. SUMMARY 
Two things were performed in this’ chapter. First, 
regression models were obtained for each contractor that 


allowed for comparison of their total overhead costs. 
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TABLE 9 
RESULTS FOR CONTRACTOR E 


Model: TOTOHE = a + bBDIRPERE 


Untransformed Data 


ue 
m~ 
we 
i 


Standard Error of the Regression: 
Adjusted R-Square n2Za27 
F-statistic aac es of freedom): 73 
Estimate of a: 

Sean@amamerror : 

Estimate of b: 

Standard Error: ee 
Durbin-Watson Test Statistic: 
WelLlis Teste statistic: 
Estimator Three: 
jo sigabuiehelere sifeibhee 
Estimator Nine: 
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Transformed Data 


Estimator Three 


Srandardw@prror of Regression: 2890 e 
Adjusted R-squared: 5 
F-statistic: eS, 
Botimate of a: 19450. 
Standard Emiror: 12880. 
Estimate of 8.802 
Standard Error: sO OZ 
Estimator Four 
StandardsError of Regression: 28.93% 
Adjusted R- Squared: me oOo 
F-statistic: 83.39 
Estimate of a: 19240. 
Standard Error: 12790. 
Estimate of 8.864 
Standard Error: 3.637 
Estimator Nine 
Seandard Error of Regression: ZS 3 Gre 
Adjusted R- Squared: Po 2 
F-statistic: 86.15 
Estimate of a: 18580. 
Seangarcetrcror: 1S OO: 
Estimate of b: 9.08 
Stangard Error: cs ys: 
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TABLE MO 


RESULTS FOR CONTRACTOR F 


Model: TOTOHF = a + bDIRPERF 


Standard Error of the Regression: 
Adjusted R-squared: 
F-statistic (degrees of freedom): 
Estimate oF 

; Se sncecedl egnate)s a 
Estimate of Db: 

Standard Error: 
Durbin-Watson Test Statistic: 
Wallis: Test Statistic: 

Estimator Three: 
Estimator Four: 
Estimator Nine: 


Estimator Three 


Standarawerroer cor Regression: 
Adjusted R- Squared: 
F-statistic: 
Estimate of a: 

Standard Error: 
Estimate of 

a Pad. Etror. 


Estimator Four 


StCanaara serGoOreon Regression: 
Adjusted R- Squared: 
F=statistic: 
EStimoace ot a: 

Standard Error: 
EStimate of 

Seaclareae | Dena alo ae 


Estimator Nine 


Standard Error of Regression: 
Adjusted R- squared: 
FoStariscerc: 
EStimate of va. 

Standard Error: 
Estimate of 

S tance: Error: 
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Data 


Second, since final models were calculated for each of the 
three estimators of py, another means to compare the estima- 
tors was obtained. 

The analysis indicated that excellent = structural 
mesults were obtained for all but one contractor, contractor 
D. All of the others yielded reliable, useful regression 
models. For comparative purposes the model that provided 


the best results for each contractor is presented: 


TOTOHA = S207 02.16" DIRPERA 
TOTOHB = -100500 + 24.68 DIRPERB 
TOTOHC = 149700 + 2.699 DIRPERC 
TOTOHD = S2175 14233 DERPERD 
TOTOHE = Meo 60 7+ 9208 DIRPERE 


TOGRGHE = 6229700 + 97175 DIRPERF. 


The model for contractor D was included in the comparison 
even though its results were unreliable. 

It is possible to use these models to compare the over- 
head cost structures of the firms in the sample. The model 
for contractor E lies everywhere below the regressions for 
contractors A and F. In each case contractor E had both a 
Significantly lower fixed cost and a lower (not significant ) 
variable cost than the other contractor. Likewise contractor 
C's model was uniformly below that of contractors F. In this 
case though, the differences in the fixed and variable costs 
were both statistically significant. It should be noted 
that the comparisons imply only that, with the same number 
of direct personnel, one contractor experiences lower total 
overhead costs than another. They do not imply that the 
contractor possesses lower total overhead costs regardless 
of the circumstances (Boger, 1983, p.33). 

For all relevant contractors the models for the three 
estimators were all very comparable. They were so comparable 


in fact, that no distinction could be drawn as to which 
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estimator was better. The superiority of estimators three 
and four over estimator nine in the Monte Carlo simulation 
was obviously not significant enough to manifest itself in 


this structural anaiveas- 
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VI. CONCLUSIONS 


The objectives of this thesis were twofold. The first 
objective was to determine the best eStimators of autocorre- 
lation found in the residuals. The second was to obtain 
Simple and efficient regression models for overhead costs 
for the six aerospace contractors. The first objective was 
Simply a means of achieving the second since the quality of 
the EGLS regression is dependent upon the quality of the 
estimator uSed. 

Two methods were utilized to determine the best 
estimators of fourth-order autocorrelation, a Monte Carlo 
Simulation and comparison of the EGLS regression models of 
the three preferred estimators (from the simulation). The 
results from the Monte Carlo simulation indicated estimators 
three and four, the two estimators that utilize the Wallis 
test statistic, to be the two best eStimators of fourth- 
order autocorrelation. Their overall performances’ were 
nearly identical with preference determined by the value of 
See estimator three was superior to estimator four for p,z 
less than or equal to .5, while four was superior for py, 
greater than .5. The maximum likelihood estimator, esti- 
mator mine, was shown to be the third best estimator. 
Another important result from the simulation was that all 
Mine of the estimators provided an improvement over the OLS 
solution when the SAR(4) process was present. Comparison of 
the respective EGLS models showed no difference between the 
performances of the three estimators. The models were so 
comparable that no distinction could be drawn between the 
estimators. 

Due to their superior performances in the Monte Carlo 
Simulation, estimators three and four were chosen as the 


best estimators of fourth-order autocorrelation. However, 
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it may have appeared from the structural analysis that esti- 
mator nine was their equal. The Monte Carlo simulation was 
much more sensitive to slight differences in the perform- 
ances of the estimators than the regression models. For 
this reason the simulation was the main criterion used in 
selecting the best estimator. Comparison of the regression 
models in the structural analysis would have been more 
useful if distinct disparities had arisen between the 
performances of the estimators. 

From this analysis no distinct preference could be made 
between estimators three and four. Both estimators are 
fairly easy to calculate once the Wallis test statistic has 
been computed, so difficulty of computation can not be used 
to determine a preference between the two. The only 
criterion found to discriminate between the two is the 
amount of fourth-order autocorrelation present in the resi- 
duals (the value of p,) and the "smoothness" of the the 
explanatory variables. If the value of p, (if it “cangiie 
speculated) is less than or equal to .5 then estimator three 
would be expected to perform better, while estimator four 
would be preferred for p, greater than .5. Though not tesmed 
herein, if the explanatory variables are "smooth" then esti- 
mator four would again be expected to outperform estimator 
three. Though not as scrupulously investigated in this 
thesis, Park and Mitchell's version of the Prais-Winsten 
estimator was selected as the estimator of first-order auto- 
correlation. It was selected because it was shown to be the 
Superior two-stage estimator for first-order autocorrelation 
in Park and Mitchell s 1930 -study- 

The structural analysis showed that excellent results 
could be obtained for four of the six aerospace contractors. 
The results of a fifth model were also more than adequate. 
These excellent results were only obtainable after the 


effects of autocorrelation were transformed out of the 
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dependent and independent variables. As was the case in 
Boger's 1984 analysis, the five Superior structural models 
Should provide excellent forecasting results. It can ulti- 
mately be concluded that, after eliminating the effects of 
autocorrelation though transformation, a simple, efficient 
model can be obtained to directly estimate overhead costs 


for five of these Six aerospace contractors. 
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APPENDIX 
COMPUTER PROGRAMS 


This appendix contains listings of the programs utilized 
in the analysis performed herein. All of the functions are 
written in APL and contain documentation. The programs 
utilized by the Monte Carlo simulation in Chapter 4 were 
SIM, RREGRESS, LAGS, DURBIN, CALL1, and TRANS. The following 
is a general description of what these programs do and the 
sequence of steps followed in the Monte Carlo simulation. 

First, the function SIM generates the data required by 
the simulation. Next, an OLS regression cf the generated 
dependent variable, vo om the independent variable, X,; a 
performed. The residuals of this OLS regression are then 
sent to the function LAGS which computes the nine estimators 
of p, and the MSE of p4- Next, the function TRANS transforms 
the data using the appropriate estimator. An EGLS regression 
is then performed on this transformed data. These last two 
steps are actually performed in the function CALLI1 which 
calls the functions TRANS and RREGRESS in Succession. The 
function LAGS then takes the results of this EGLS regression 
and computes the MSE of §5 and the Adjusted R-squared value. 
This cycle is replicated one hundred times for each value of 
p4 The function RREGRESS is called to perform all of the 
regressions required in the simulation. The functions 
DURBIN, PWIT and MAK are called to compute the Durbin, @teeg, 
ative Prais-Winsten, and maximum likelihood estimators of 
fourth-order correlation. 

The programs utilized in the structural analysis of 
Chapter 5 were REGRESS, TRANS, LAG, CHECKER, and VARTEST. 
All regressions were performed by REGRESS (MuSgrave and 
Ramsey, 1981, pp. 254-258). As can be seen from the program 
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listing, REGRESS outputs numerous statistical results. The 
function TRANS was utilized to transform the data for either 
an AR(1) or SAR(4) process. The functions LAG and CHECKER 
were called to compute numerous eStimators and test statis- 
tics (for first and fourth-order autocorrelation) from the 
residuals. The F-test utilized in the analysis was performed 
using the function VARTEST. 


APL FUNCTION SIM 


RHOS SIM REP 
C1] aAZHIS PROGRAM PERFORMS A MONTE CARLO SIMULATION ON 
mel HE NINE ESTIMATORS OF pt. THE INPUT RHOS IS A 
C3] aAVECTOR OF p4 VALUES THAT YOU WANT TO RUN THE 
C4] aASIMULATION FOR, IN THIS CASE IT IS A VECTOR OF 
mumeeecenG’ H THN OF THE VALUES: .1, .2, -3, -4%, .5, .6, 
meee. ? aoc, .9, AND .95. THE INPUT PARAMETER REP JIS 
C7] aAZTHE NUMBER OF REPLICATIONS OF THE SIMULATION YOU 
BeemeeawlSH TO RUN, IN THIS CASE 100. 
(€9] PRINT+« 3 9 10 
£10] OUT<1 
C11] RMSERHO<AAR<EFF<«<RANKRHO<RANKAAR<RANKEFF 
<((pRHOS),9)p0 
C12] £2:'PERFORMING TEST FOR RHOYW = ',6RHOSCOUT) 
C13] RHO4<RHOSCOUT] 
C14] ACREATION OF NECESSARY VECTORS. 
C15] MSERY<MSERYS<MSERYL<MSEPY<MSEPYUS<MSEPLS<MSED<0 
116] MSEBOLS<MSEBRY<MSEBRYS<+MSEBRYUL<MSEBPYU<MSEBPYS<+MSEBPLS 


_ <MSEBD<0 
C17] AAR2O0LS<AAR2RY4Y<AAR2RYS<AAR2RYL<+AAR2P4U<AAR2PUS<AAR2PLS 
<AAR2D<0 
C18] MSEPW<MSEPPML<MSEBPW<«MSEBPPML<AAR2PW<AAR2PPML<REPPW 
<«REPMAX<0 
C19] PWITFAIL<0O 
azo 6 N<1 


[21] ARANDOM GENERATION OF THE U(T). THEY ARE 


7/7 


E22] 
i228 
[24] 
E2Sa) 
xsl 
i274 
[28] 
[29] 
£30] 
Te 
p22] 
Reichl 
C34] 
eee 
[36] 
(37] 
[38] 
[39] 
C40] 
C41] 
C42] 
C43] 
C4uy] 
Cus] 
C46] 
C47] 
Cus] 
[49] 
[50] 
Reig 


fo28 
CSA 


F544 


ADISTRIBUTED NORMALLY WITH MEAN 0 AND VARIANCE 
AACCORDING TO THE CONTRACTOR. 

L1:U<32 NORRAND O 13454 

ASET ALL ECT) TO UCE)= CA eee 

E<U+((1- RHOYU*2 )*0.5) 

i 

ASET THE LAST T-4 E(T)S TO (RHOUXE(T-4))+ UCT). 
IT: ECI1]<(RHOUXELI-4) )+U(T] 

7 

go Be he Bac 

ASET B TO THE APPROPRIATE INTERCEPT AND SGLOCPE 
AFOR THE CONTRACTOR BEING TESTED. 

Be 0 17 

XCURRENT<1,XCURRENT 

AGENERATE THE INDEPENDENT VARIABLE, Y 
YCURRENT+<( XCURRENT+.xB)+ET< 24+E 

XCURRENT<« 24 §1 *XCURRENT 

APERFORM THE OLS REGRESSION OF PRESET 
ADEPENDENT VARIABLE, X, ON THE GENERATED 
AINDEPENDENT VARIABLE, Y. 

YCURRENT RREGRESS XCURRENT 

MSEBOLS<MSEBOLS+( (BER([2]-Bf2] )*2) 
AAR20LS<AAR20LS+ADJR2 

ACALL THE FUNCTION LAGS WITH THE RESIDUALS FROM 
ATHE OLS REGRESSION AND THE VALUE OF R#HO4. 
RHOY LAGS UH 

IN<IN+1 

>L1x1INSREP 

ACOMPUTATION OF MOE'S AND OUTPUT 


MSERAO<MSERY ,MSERYS ,MSEPY ,MSEPYS ,MSED ,MSEPLS ,MSERUYL, 


MSEPW ,MSEPPML 
RMSERHOCOUT; 1<«(MSERHO+REP )*0.5 


AR<AAR2R4 ,AAR2RYS ,AAR2PY ,AAR2PYUS ,AAR2D,AAR2PLS, 


AAR2RYUL ,AAR2PW,AAR2PPML 
AAR(OUT ; 1<AR+REP 
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[55] MS<MSEBRY ,MSEBRYUS ,MSEBPY ,MSEBP4S,MSEBD,MSEBPLS, 
MSEBRUL ,MSEBPW ,MSEBPPML 

[56] RMSEBETA<(MS+REP)*0.5 

[57] EFFCOUT;1<( (MSEBOLS+REP)*0.5)+RMSEBETA 

[58] PRINT(1;1<RMSERHO[OUT;] 

[59] PRINT(23;1<AAR(OUT;] 

[60] PRINT(33;1<EFF(OUT;] 

[61] RANKRHOCOUT; 1<MARMSERHOCOUT; | 

[62] RANKAAR(COUT;1<AVAARLOUT3] 

[63] RANKEFF(OUT;1<MVEFFCOUT;] 


[64] 'FOR RHOW = ',86RHOU 

(65] ‘RMSE RHO,ADJUSTED RSQR AND EFF:' 

66] ' 1 2 3 4 a 6 7 8 Q! 
H67)] PRINT 

[C68] ‘AVG ADJ R2 FOR OLS = ',8(AAR20LS+REP) 

[69] ‘AVG REPS FOR PWIT = ',8(REPPW+REP) 

C70] 'PWIT FAILED TO CONVERGE = ',tPWITFAIL 

C71] ‘AVG REPS FOR MAX = ',8(REPMAX+REP) 

m2) * 


mea OUT<OUT+1 

C74] 2L2x10UT<SpRHOS 
Mee) )6' RMSE 6RHO'* 

[76] RMSERHO 

77] ' } 

[78] 'RANK RMSERHO! 
[79] RANKRHO 

[80] 'AVG RANK RMSERHO'! 
Sem) «66 +7 RANKRHO )=pRHOS 


m2] '.! 
[83] ‘ADJUSTED Rx2! 
C84] AAR 
(35 


C86] 'RANK ADJUSTED Rx2! 
[87] RANKAAR 
[88] 'AVG RANK ADJ Rx2' 


79 


[89] 
[90] 
[91] 
[92] 
[93] 
C94] 
[95] 
[96] 
[97] 


(+/¢RANKAAR )+pRHOS 
! t 

‘EFFICIENCY! 

EFF 

t 1 

‘RANK EFFICIENCY' 
RANKEFF 

‘AVG RANK EFF.! 
(+/¢RANKEFF )+pRHOS 


APL FUNCTION LAGS 


Ea 
EZa 
C3] 
C4] 
ES 
C6] 
Eva 
[8] 
[9] 
ra ea 
Et 
E12] 
C13] 
C14] 
Ets 
[16] 
Ea 
[18] 
C19] 
[20] 
E24] 


RHOW} LAGS XxX 
AUSED IN THE MONTE CARLO SIMULATION. IT IS CALLED 
ATO COMPUTE THE NINE ESTIMATORS OF p¥ AND TO 
SPERFORM THE EGLS REGRESSION AND CALCULATE 
ATHE MOES FOR EACH ESTIMATOR. THE INPUT RHO IS THE 
RVALUE OF p¥ THAT THE SIMULATION IS BEING PERFORMED 
AFOR THE INPUT XX IS A VECTOR OF THE RESTVUALS. 
eo) 

S<pXX 

L<Jt+1 

K<J-4 

AA<EE<1K 
ADETERMINE THE UNLAGGED AND LAGGED RESIDUALS. 
BOOP Dei. 

AA([I]<Xx(ID] 

EECI1<XX(ID- 4] 

Loney 

>LOOPxX1I<K 

TH<(pxXX )-4 

P5<T4H- 1 

A<+/XKx2 

A2<+/((0,0,0,0, G@5p1)00))7 eeeZ 
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i227) 
i273 ) 
[24] 
[cg 
[26] 
oy] 
[28] 
[29] 
BESO | 
el] 
fe?) 
£6sid 
C34] 
fo] 
[36] 
e371] 
[38] 
[39] 
C40] 
C41) 
G32) 
C43] 
C4u] 
C45] 
Cu6] 
[47] 
Cus] 
C49] 
E50) 
tsola 
Pe7 | 
fs | 
C54) 
0S sa 
[56] 


Dee CsA xt/(((0,0,0,0,7401 )/XX )- ((7T4p1),0,0,0,0)/XX )*2 

ACALCULATE THE SEVEN ESTIMATORS OF p¥ 
Ru<(2A)x+/(((0,0,0,0,74¥p1 )/XX )x ((T4p1),0,0,0,0)/XX) 
RUSTAR<((ai- 2)xR4 )#(J-1) 
RUL<(+A2)x+/(((0,0,0,0,74p1 )/XX )x((74p1),0,0,0,0)/XX) 
Pu<i- (0.5xD4u) 

PUSTAR<(((J*2)xPuU )+4 Ft ((7*2)-4) 
PLS<AABEE 

aCALL THE FUNCTIONS DURBIN, PWIT, AND MAX TO 

ACALCULATE THE DURBIN, ITERATIVE PRAIS-WINSTEN, 

aAND MAXIMUM LIKELIHOOD ESTIMATORS 
DURBIN 
PWIT 
+MMx 1 (PWITF=O ) 
PWITFAIL<PWITFAIL+1 
*FFx1 (PWITF=1) 

MM: REPPW<REPPW+(NPW- 1) 

FF:MAX 
>+AAx1 (MAXF=O ) 

AA: REPMAX<REPMAX+(NML- 1) 

ACALCULATE THE MSE OF p4 FOR EACH ESTIMATOR 
MSERY<MSER4 + ( (RY- RHO )x*2 ) 
MSER4S<MSERYS+ ( (RYSTAR- RHOY ) x2 ) 
MSERYL<MSERUL+ ( (RYL- RHOU )*2 ) 
MSEPU<MSEP4 +((Pu- RHOY )*2 ) 
MSEP4S<MSEP4S+ ( (PYSTAR- RHO4 )x2 ) 
MSEPLS<MSEPLS+( (PLS- RHO )*2) 

MSED<MSED+( (PDURBIN- RHO¥ )x*2) 
2*MSx 1 (PWITF=1) 
MSEPW<MSEPW+( (PW- RHO¥ )x*2 ) 

MS :MSEPPML<MSEPPML+ ( (PPML- RHO )*2) 

APERFORM THE EGLS REGRESSION AND CALCULATE MSE BETA 
AAND ADJUSTED R-SQUARED FOR EACH OF THE ESTIMATORS 
Ciel Au 
MSEBRY<MSEBRY+( (BERC2)]-BL2])x2) 
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Paya 
[S34 
Eso 
[60] 
Eee) 
[62] 
[63] 
C64] 
[65] 
[66] 
[67] 
[68] 
[69] 
E7 Ou 
E 7m 
72a 
E73) 
C74] 
wean 
E7 oll 
7 72) 
E7 3S 
E79] 
[80] 
[81] 
[82] 


AAR2R4<AAR2R4U+ADIR2 

CALL1 RuSTAR 
MSEBR4S<+MSEBRUS+ ( (BERL2]- BL2] )x2) 
AAR2RUS<AAR2RUS+ADJR2 

CALL1 RUL 

MSEBRUL<MSEBRUL+( (BER(2]-BL2] )x*2) 
AAR2RUL<AAR2RUL+ADJIR2 

CALL1 Pu 
MSEBP4<MSEBP4 + ((BER(2]-Bl2])*2) 
AAR2Pu<AAR2PU+ADJIR2 

CALL1 PUSTAR 

MSEBPUS<MSEBPuS+( (BER(2]-BL2] )*2) 
AAR2PUS<AAR2PUS+ADJR2 

CAI PES 
MSEBPLS<«MSEBPLS+((BER(21]-BL2] )*2) 
AAR2PLS<AAR2PLS+ADJR2 

CALL1 PDURBIN 
MSEBD<+MSEBD+((BER(21]-BL2])x2) 
AAR2D<AAR2D+ADJIR2 

>*CCx1 (PWITF=1) 

CALL1 Pw 
MSEBPW<MSEBPW+ ( (BER(2]- B([2] )*2) 
AAR2PW<+AAR2PW+ADJIR2 

CC:CALL1 PPML 
MSEBPPML<«MSEBPPML+((BER[2]- BC2] )*2) 
AAR2PPML<AAR2PPML+ADJR2 


APL FUNCTION TRANS 


C1) 
[2] 
3) 
Dard 


P TRANS V 
ATHIS FUNCTION TRANSFORMS THE RAW DATA FOR EITHER 
AAN AR(1) OR AN SAR(4) PROCESS. THE INPUT P IS A 
AVECTOR OF LENCTH 2 WHERE P({1l IS THE BSZiMAT eee eo 
AAND P(2] IS THE ESTIMATE OF po. THE INPUT V IS THE 
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foo 

C6] 

ie? | 

[8] 

[9] 

[10] 
[11] 
[12] 
[13] 
C14) 
b15) 
[16] 
C17] 
[18] 
[19] 
[20] 
—21] 
B22 | 
[23] 
C24] 
m2 5] 
£26 J 
27) 
[28] 
[29] 
[30] 
[31] 
m2 J 
[33] 
C34] 
f5 ] 
[36] 
Ea7 1] 
C38] 
[39] 


AVARIABLE YOU WANT TO BE TRANSFORMED. V CAN BE 
AaEITHER A VECTOR OF LENGTH N OR A Nx1 MATRIX. 
DIM<ppV 

>VECTORX1DIM=1 

ARESHAPE V INTO A VECTOR IF IT WAS ENTERED 

AAS A Nx1 MATRIX 

CHECK<pV 

CHECK1<CHECK(1] 

V<CHECKipV 
VECTOR: P1<P[1] 

P2<P[2] 

+NEXTX1P2>0 

aLOOP TO BE PERFORMED FOR THE AR(1) TRANSFORMATION 
I1<(pV)-1 

V1<(0,21p1)/V 

VP1<((I1p1),0)/V 

VV1<VC11x((1- (P1*2))*0.5) 

VV2<V1- (P1xVP1) 

VV<VV1,VV2 

>+ENDx1DIM=1 

ARESHAPE THE VARIABLE INTO A Nx1 MATRIX IF IT 

AWAS ENTERED AS SUCH 

VV<(CHECK1,1)pVV 

+END 

AaLOOP TO BE PERFORMED FOR THE AR(4) TRANSFORMATION 
NEXT: Iu<(pV)-4 

Vi—(OmOmNO, 0, Mtp1)/V 

VP4<((2I4p1),0,0,0,0)/V 

ACHECK IF THE ESTIMATOR IS < 1.0 AND IF NOT: 
>OKxX1P2<1 

a 1) SET THE ESTIMATOR TO 
P2<0.99999 

ASAR(4) TRANSFORMATION COMPUTATION 
OK: VP1234<usy 
VV1234<VP1234x ((1- (P2*2))*0.5) 


.99999 
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C40] VV5<V1- (P2xVPu ) 

C41] VV<VV1234,VV5 

[42] +ENDx1DIM=1 

C43] ARESHAPE THE VARIABLE INTO A Nx1 MATRIX IF IT 
C4uu] aWAS ENTERED AS SUCH 

C4u5] VV<(CHECK1,1)pVV 

C46] END: 


APL FUNCTION DURBIN 


DURBIN 
[1] aZTHIS FUNCTION CALCULATES THE DURBIN ESTIMATOR OF pu. 
[2] DIMD<p YCURRENT 
ese T4YUD<DIMD- 4 
Cu] YDURBIN<(0,0,0,0,74Dp1)/YCURRENT 
ue XDURBIN<(THD,4)p0 
[6] XDURBIN(C3;11]<1 
[7] XDURBIN(L;21<((24Dp1),0,0,0,0)/YCURRENT 
[3] XCT«DIMDpXCURRENT 
[9] XDUREINT +3 1-(0,0,0,0;24Dp1 oy xcr 
[10]. XDURBINE:4)~<-1x(((74Do1) 0, 0,0, 077 xe 
[11] COE<YDURBINSXDURBIN 
fi2] “P5<COzZ 02) 


APL FUNCTION PWIT 


PWIT 
L1] aFHIS FUNCTION COMPUTES THE ITERATIVE PRAIS-WINSTEN 
[2] aESTIMATOR. 
Bee TPepreUnne Nn. 
Cu) TPW<TP- 4 
ere CHE<2 
[6] PWITF<«NPW<0 
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C7 J 
C8] 
ce 
E10] 


ia 1 
Dol 


[13] 
[14] 
[15] 
[16] 
[17] 
[18] 
[19] 
[20] 
[21] 
[22] 


e232] 
C24] 
25) 
[26] 
27 | 
[28] 
[29] 
FO | 
is@ 
eS | 


XPW<«XCURRENT 

YPW<YCURRENT 

XPWT1<XPW1<(TP,1)p1 
ITER: UHPW< (YCURRENT- (YHPW<(1,XCURRENT )+.x (BEPW<+YPWH 
(XPW<(XPWT1,XPW))))) 
DEN<+/((0,0,0,0,((ZPW-1)p1),0)/UHPW)x2 
PW<(+DEN)x+/(((0,0,0,0,7PWp1)/UHPW )x((TPWp1),0,0,0,0) 
/UHPW) 

Pr-O ew 
ATRANSFORM X AND Y 

P TRANS XCURRENT 

XPW<VV 

P TRANS YCURRENT 

YPW<VV 
AIF P>1 SET TO .99999 

>+CONPWx 1 (PW<1) 

PW<0.99999 
CONPW: XPWT1«XPW1x ((TP,1)p (4p ( (1- PH*2 )*0.5)), ((LP-4) 
0(1-PW))) 

NPW<NPW+1 

>+DDx1 (NPW<52) 

PWITF<1 

>0 
DD: DELTA<CHE- PW 

CHE<PW 
APERFORM ANOTHER ITERATION IF THE ABSOLUTE 
ADIFFERENCE BETWEEN THE LAST TWO ESTIMATIONS 

peo = 2.00001. 

>ITERx1 (| DELTA )>0.00001 


APL FUNCTION MAX 


oe 


MAX 
ALTHIS FUNCTION COMPUTES THE MAXIMUMLIKELIHOOD 
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2a 
Gs 
Ct 
od 
[6] 
[7] 
[8] 
Bei 
Beken 
Ean 
bie) 


[area 
C14] 
false 
[16] 
Gly 
[18] 
[19] 


E263) 
24d) 
Eee) 
acd 
[24] 
2) 
26) 


B2 7a 
[28] 
C29] 
Devo 
Eejaka 
ik 24) 
paad 


ALF Pole Ser 10 


AESTIMATOR OF oY USING THE ALGORITHM DERIVED 
AaBY BEACH AND MACKINNON. 


NPML<p YCURRENT 

CH<2 

TUML<NPML- 4 
XML<XCURRENT 

YML<«Y CURRENT 
XMLT1<«XML1<(NPML,1)p1 
MAXF«NML<O | 


ACOMPUTE THE RESIDUALS FOR THE REGRESSION OF X ON Y. 
ML:UR<(YCURRENT- (YAML<(1,XCURRENT )+.x (BEML<YMLE (XML 


<(XMET1,XML))))) 

A4¥<(1,1,1,1, (TZ4MLp0))/UH 
AT4<((TY4YMEp1),0,0,0,0)/UH 
AT<(0,0,0,0,74MLp1)/UH 
DENOM<(NPML- 1)x((+/ (ATH *2) )- (+/(A4*2))) 


ACOMPUTE THE COEFFICIENTS OF THE POLYNOMIAL 


AML<( 1x (NPML- 2)x(+/ C(ATxAT4 ) ) )+DENOM 

BML<(( (NPML- 1) (+/(A4*2)))- (CNPMEx (+/ATU*2 ))+ 
(+/AT*2)))+DENOM 

CML<(NPMLx (+/ATXxAT4Y ) )+DENOM 

PMLP<BML- ((AMLx2 )#3) 

QML<+CML+ (2x (AML*3 )+27 )- (AMLxBML+3 ) 

PHI1<( (QMLx (27*0.5))#(2xPMEPx((' 1xPMEP)*0.5))) 
PHI< 20PHI1 


ACOMPUTE THE ESTIMATOR 


PPML<( 2x(€(( 1xPMLP)#3)*0.5)x(20( (PHI+3 )+0(#3))))- 
(AML+3) 
PPPML<0 ,PPML 


ACALL THE FUNCTION TRANS TO TRANSFORM THE RAW DATA 


PPPML TRANS XCURRENT 
XML<VV 

PPPML TRANS YCURRENT 
YML<VV 

.99999 
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C34] 
35.) 
[36] 


[37] 
[38] 
[39] 
C40] 
[41] 
C42] 
C43] 
C44] 
C45] 
C46] 


>CONMAXx1 (PPML<1) 

PPML<0.99999 
CONMAX : XMLT1<XML1x ((NPML ,1)p (4p ((1- PPMZ*2 )x0.5)), 
( (NPML-4)p (1- PPML ))) 

NML<NML +1 

>MAFx 1 (NMLS52 ) 

MAXF<1 

>0 
MAF: DELT<CH- PPML 

CH<PPML 
AIF THE ABSOLUTE DIFFERENCE BETWEEN THE LAST TWO 
AESTIMATES IS > .00001 THEN PERFORM ANOTHER 
AITERATION 

>MEx1(|DELT)>0.00001 


APL FUNCTION CALL1 


C1] 
2) 
C3] 
C4] 
eS) 
Be) 
C7] 
[8] 
[9] 
[10] 


CALL1 EST 
ATHIS FUNCTION CALLS THE TRANSFORMATION FUNCTION 
ATRANS FOR AN ESTIMATOR OF p'¥ AND THEN CALLS THE 
AFUNCTION RREGRESS AND PERFORMS AN EGLS REGRESSION 
RON THE TRANSFORMED DATA. 
Bho <0 fee. 
PEST TRANS XCURRENT 
XT<VV 
PEST TRANS YCURRENT 
YTV 
YT RREGRESS xT 


APL FUNCTION RREGRESS 


re 


Y RREGRESS X3;MS;SS 
ATHIS IS A CONDENSED VERSION OF REGRESS LESS 
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Zl 
C33 
C4] 
CS) 
f@i 
C7] 
C8] 
toa 
E204 
C11] 
fas 
pis) 


ATHE PRINTED OUTPUT TO BE USED IN CONJUCTION WITH 
ATHE SIMULATION SIM. 

NP<pX 

K<NP(2] 

Song el(o 0 

MS<((+/Y),MS<+#X )+NP(1] 
SSC21<(SSC1]<+/Y*2 )- NPC1]xMS(1]*2 
UH<(Y- (YH<«X+.x (BER<Y8(X<(1,X))))) 
SSC3)]<+/UH*2 

R2-i- Sol 3dimeooll ce 
NMINUSK<NP(1]- (K+1) 

ADJR2+«R2- ((K+NMINUSK )x (1-R2)) 


APL FUNCTION REGRESS 


C1] 
fee 
lecia 
C4] 
Eee 
C6] 
ye 
[8] 
[9] 
Ese 
Gee 
b12) 
Gabe a 
C14) 
fi 5a 
[16] 
fa7] 
[18] 


Y REGRESS X:MS3:D3;SS 
ATHIS FUNCTION PERFORMS AN OLS REGRESSION OF X ON Y. 
AZHE INPUT X IS THE INDEPENDENT VARIABLES es 
AASSUMED TO BE AN NxK MATRIX. THE INPUT Y IS THE 
ADEPENDENT VARIABLE AND IS ASSULMED TO BE A VECTOR 
AOF LENGTH N. A DIAGNOSTIC IS PRINTED IF NSK OR IF 
ARANK X<K. THE CONSTANT TERM IS ADDED BY THE PROGRAM. 
ATAIS FUNCTION WAS OBTAINED FROM RAMSEY AND 
aMUSGRAVE'S PL-ST T (SEE REFERENCES). 
FLAG<1 
NP<pxX 
K<«NP[2] 
Sos op 
+PREMEND1 x1 (NPC1)]<SNPC2]+1) 
CM< (MM< (QX )+. xX )- (MS° .xMS<+#X )#NP(1] 
CRM<D+.xCM+.xD<(((1K)°.=1K)xCM)*0.5 
MS<((+/Y),MS )+#NPC1] 
SS(2]<«(SSC1]<+/Y*2)-NPC1]xMSC11*2 _ 
MAIN: UH«(Y- (YH*X+.x (BE*Y@(X« (XCONY,X))))) 
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ecm oc Gl (Cools )<+/YH*2 )- NPL1IIxMSL1)*2 

C20) SS(C53<+/UH*2 

{21] RSQ<1-SS(5]+SS(2] 

[22] NMINUSK<NP([1]- (K+1) 

[23] ADJRSQ<+RSQ- ((K+NMINUSK )x(1-RSQ)) 

[24] VARU<SS(5]+(-/NP)-1 

(25) STDERR<VARU«0.5 

[26] F1<SS(C4jJ=+SSC51xK+(-/NP)-1 

[27] F2<SS(3]+SSC51]x(K+1)+(-/NP)-1 

[28] COVBE<VARUxB(&X )+.xX 

[29] STDBE<(1 1 &COVBE)*0.5 

[30] TRATIO<BE+STDBE 

nai] CONT 

[32] PREMEND:FLAG<0 

C33] CONT:+MAINENDx1FLAG 

[34] 'ROUTINE ENDED DUE TO SINGULARITY OF X MATRIX' 
fio) +0 

[36] MAINEND:'THE COEFFICIENT ESTIMATES ARE, CONST.,X1:' 
ie7) BE 

[38] 'THE CORRESPONDING STANDARD ERRORS ARE.' 

[39] STDBE 

C40] 'THE CORRESPONDING T RATIOS ARE:' 

C41] RATIO 

[42] 'WITH DEGREES OF FREEDOM: ' 

[43] NMINUSK 

Cuu] 'RSQ IS: ',8RSQ 

[45] ‘ADJUSTED RSQ IS: ',8ADJRSQ 

[46] 'STANDARD ERROR OF REGRESSION IS: ',tSTDERR 
[47] 'VAR OF ERROR TERM IS: ',86VARU 

C48] 'THE F STATISTIC INCLUDING THE CONSTANT TERM IS:' 
C49) F2 

(50) ‘WITH DEGREES OF FREEDOM: ' 

C51) (K+1),(€ 1+-/NP) 

P52] ‘THE F STATISTIC NOT INCLUDING CONSTANT TERM IS:' 
55s) et 
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C54] 
Gow 
ES6] 
Poel 
[58] 
S94 
[60] 


"WITH DEGREES OF FREEDOM, ! 
K,(€ 1+-/NP) 
'THIS ENDS THE OUTPUT FROM ARECERESore 
+0 
PREMEND1:'NO. OF OBS (N) IS TOO FEW RELATIVE THE ' 
'NO. OF REGRESSORS (K).' 
"ROUTINE TERMINATED! 


APL FUNCTION LAG 


oe 
Eon 
Ree 
C4) 
Ea) 
[63 
[7] 
[8] 
[9] 
1:05) 
fi] 
i124 
ikea 
C14] 
Gian 
[16] 
ta 
C18] 
C19] 
[20] 
fu) 
[22] 
[23] 


LAG Xx 
AGIVEN THE RESIDUALS THIS FUNCTION CALCULATES THE 
ADURBIN-WATSON AND WALLIS TEST STATISTICS, THE 
ASINGLE ESTIMATOR OF o1, AND THREE PREFERED 
AESTIMATORS OF pop (FROM THE SIMULATION). THE INPUT 
aXX IS A VECTOR OF LINGTH N OF THE REGRESSION 
ARESIDUALS. 
JSeOXX 

Pic! 

Pai Pa 

Tu<J- 4 
Ae ene 
Mive+/< (0,07 261) 20 7x2 
ACALCULATE THE TWO TEST STATISTICS 

Di< (A )x+/((00,71e1)/XX )- CCP 1 pl) foe 2 

Du<« (C24 )x+/(0(0(0,0,0,0,74p1 )/XX )- ((74p1),0,0,0,0)7 2 eee 
ACALCULATE THE ESTIMATOR OF pl 

PP<(4A1 )x+/((C0O,7ipi )/XX )x( (Pio, On ee 
ACALCULATE THE THREE ESTIMATORS OF p¥ 
AESTIMATOR NUMBER THREE IS 

P3<1- (0.5xDu ) 
AESTIMATOR NUMBER FOUR IS 
Pu<(((J*2 )xP3)+4 )t((7*2)-4) 
AESTIMATOR NUMBER NINE IS THE MAXIMUM LIKELIAOOD 
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[24] aAESTIMATOR 


C25] MAX 

base §' 1 = '4c01 

i271) ‘DEGREES @F FREEDOM = “,s7'1 

[28] '‘PRAIS-WINSTEN 1ST ORDER ESTIMATE ,P1 = ',8PP 
C29] ‘DH = ',sD4 

[30] 'DEGREES OF FREEDOM = ',osT74 

(31J] 'DURBIN-WATSON 4TH ORDER ESTIMATE, P3 = ',6P3 
C32] 'D-W THEIL-NAGAR MOD. ESTIMATE, PY = ',dP4 
Mee) 'THE MAX LIKE ESTIMATOR IS, P39 = ',tPPML 


APL FUNCTION CHECKER 


CHECKER RESL 
[1] aTHIS FUNCTION COMPUTES THE DURBIN-WATSON AND 
[2] aWALLIS TEST STATISTICS ON THE RESIDUALS OF A 
[3] aAREGRESSION. THE INPUT RESL IS A VECTOR OF LENGTH 
[4] aN OF RESIDUALS. 
[5] aDETERMINE DEGREES OF FREEDOM 
[6] TC1<«(pRESL)-1 
C7] TC4<(pRESL)-4 
C8] AC<+/RESL*2 
[9] aDETERMINE TEST STATISTICS 
[10] D1C<(#AC)x+/(((0,7C1p1)/RESL)- ((7C1p1),0)/RESL)*2 
mei) =6DUC<(+AC)x+/(((0,0,0,0,7C4%p1 )/RESL)-((TC4%p1),0,0,0,0) 


/RESL)x2 
C7) 
eS | 'AFTER TRANSFORMATION! 
C14} 'p1 = ',s8D1C 
ele) 'DEGREES OF FREEDOM = ',s87TC1 
[16] ‘D4 = ',6DUC 
ey 'DEGREES OF FREEDOM = ',s87C4 


ra 


APE FUNCTION VARIES. 


ALPHA VARTEST S 
C1} aZHIS FUNCTION PERFORMS TRE F-TEST 2OVGRECh ee] 
[2} amHOMOGENEITY OF THE VARIANCE OF THE RESIDUALS. 
Esa NO<0.5x0S 
C4] SA<+SB<NOp0 
aoe DOF<NO-1 
[6] D<2eD0F 
[7] aSERARATE THE RESIDUALS INTO TWO GROUPS. 
C8] SA<NO+4S 
[9] SB<NOVS 
[10] ABAR<(+/SA)+NO 
C11] BBAR<(+/SB)+NO 
L12] acALCULATE THE VARIANCES OF THE TWO GROUPS. 
C13} ASOS<+/(SA- ABAR )x2 
C14] BSOS<+/(SB- BBAR )x2 
[15] aCALCULATE THE TEST STATISTIC Kh AVDMI AE vee Tease 
[16] aRECGION. 
[17] R<(CASOST BSOS )+(ASOSLBSOS ) 
[18] Ki<D FQUAN(ALPHA:2) 
C19] K2<D FQUAN(1- (ALPHA+2)) 
L20] aDETERMINE THE LEVEL AT WHICH THE TEST STATISTIC Is5 
U2) ASITCGNPTRIECANT. 
[22] ALPHAC<2x(1-(D FCENT R)) 
[23] 'FOR AN F TEST WITH ALPHA = '!,tALPHA , 
C24) ADETERMINE IF THE TEST STATISTIC R FALLS WITHIN 
[253 0AVHE CRITICAL REGION. 
E26) mAxyCCR-K1 ACR cK?) 
[27] ‘REJECT Ho: VAR1 = VAR2' 
[28] 72NEXT 
C29} As'tACCEPT Ho: VAR1 = VAR2' 
L30] NEXE.  FeSeATISEIC = “Ror 
Ee 'OBSERVED LEVEL OF SIGNIF. IS ALPHA = ',tsALPHAC 
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